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We present draft genome sequences of 15 clinical Vibrio isolates of various serogroups. These are valuable data for use in study- 
ing Vibrio cholerae genetic diversity, epidemic potential, and strain attribution. 
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I lihrio cholerae is a Gram-negative bacterium and the etiologic 
Ir agent of cholera. Cholera is endemic in regions with poor 
sanitation and can be a significant problem wherever local infra- 
structure is disrupted due to natural or human-caused disasters. A 
recent bulletin of the World Health Organization reported that 
there are an estimated 2.8 million cases and 91,000 deaths annu- 
ally due to cholera, with the highest burden being in children 
under the age of 5 years ( 1 ). 

V. cholerae strains can be classified into over 200 serogroups 
on the basis of differences in the surface-expressed O antigen. 
Typically, cholera epidemics are associated with specific sero- 
groups, namely Ol and 0139, although the recent events in 
Haiti demonstrate an interesting exception (2). It is currently 
not known why specific serogroups are associated with epi- 
demic potential even though many other serogroups carry the 
major virulence factors. 

Fifteen clinical isolates of various serogroups from the Saka- 
zaki type strain collection were draft sequenced using the 
Roche/454 Titanium sequencer (see Table 1). Although all 15 
are patient isolates, 13 of them belong to non-01/non-0139 
serogroups. 

Genomic DNA was extracted using the Wizard kit (Pro- 
mega, Madison, WI). 454 Titanium libraries were prepared and 
emulsion PGR (emPGR) was performed per the manufactur- 
er's instructions, prior to multiplexed sequencing and de novo 
assembly. On average, over 500,000 sequencing reads were pro- 
duced per strain, resulting in genome coverage ranging from 
19.4- to 58.4-fold depth. 

Preliminary analyses indicate that some of the major 
V. cholerae virulence factors are present in a subset of these 
genomes. For instance, we identified sequences corresponding 
to all or a large part of V. cholerae pathogenicity island 1 (VPI- 
1), not only in NIH41 and 63-93 (Ol and 0139, respectively), 
but also in nonepidemic strains 981-75, 1421-77, 571-88, 523- 



80, and 234-93. We identified sequences with homology to the 
cholera toxin genes in NIH-4 1 and 63-93, as expected, as well as 
in 571-88, 234-93, and 490-93. Finally, we identified sequences 
corresponding to all or part of VPI-2 in NIH-41 and 981-75. 
Given that these are draft genomes and in some cases gaps exist 
within these particular regions, further investigation is re- 
quired to elucidate whether these virulence factors are func- 
tional. These draft genomes have the potential to aid our un- 
derstanding of virulence and epidemicity in V. cholerae. These 
data contribute to broadening the reference genome collection 
available for studies aimed at V. cholerae genetic diversity and 
strain attribution. 

Nucleotide sequence accession numbers. The sequence data 
were deposited in GenBank under project accession numbers 
listed in Table 1. 

TABLE 1 Listing of strains with serogroups, original country of 
isolation, and GenBank accession numbers 

Isolation 

Strain Serogroup country Accession no. 

NIH41 Ol India IIDHOOOOOOOO 

5473-62 031 Philippines IIDIOOOOOOOO 

1311-69 035 India JIDJOOOOOOOO 

133-73 048 India JIDKOOOOOOOO 

1157-74 053 India JIDLOOOOOOOO 

981-75 065 India JIDMOOOOOOOO 

8-76 077 India JIDNOOOOOOOO 

1421-77 O80 India IMBLOOOOOOOO 

984-81 089 India IMBMOOOOOOOO 

571-88 O105 China JIDOOOOOOOOO 

523-80" 0115 United States JIDPOOOOOOOO 

63-93 0139 India IMBNOOOOOOOO 

234-93 0141 India IMBOOOOOOOOO 

254-93 0144 India IMBPOOOOOOOO 

490-93 0155 Thailand JIDQOOOOOOOO 
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